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FOREWORD 


This  report  is  a  preprint  of  a  paper  with  the  same  title  which  is 
scheduled  to  appear  in  the  September  1982,  issue  of  the  IEEE  Trans¬ 
actions  on  Information  Theory  (vol.  IT-28).  This  is  the  second  part  of 
a  two-part  study,  the  first  part  of  which  is  published  as  CSL  Report  No. 
R-931. 
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I.  Introduction  and  Preliminaries 


As  in  Pert  I  of  this  study  [I],  we  consider  the  signel  detection 
problem  described  by  the  following  pair  of  statistical  hypotheses  concerning 
a  random  process  {x^j  i  “  1,2,...,*}  from  which  we  have  a  sequence 
(x^;  i  -  1,2,..., n}  of  observations: 

H0;  \  \  ;  i  -  1,2, ...,n 

versus 

H.:  X±  »  +  9  ;  i  -  l,2,...,n  (1) 

where  i  -  1,2,...,*}  is  a  stationary  noise  process  and  6  is  a  known 
positive  signal.  Part  I  of  this  study  [1]  considered  the  problem  of 
designing  asymptotically  efficient  detection  systems  for  the  problem  of  (1) 
in  which  the  noise  process  exhibits  a  weak  moving-average  type  of  dependence. 
Specifically,  [1]  considered  the  noise  model 

Ni  "  *  Yi-1  +  Yi  +  P  Yi+1  >  (2) 

where  [Y^;  i  €  Z}  is  an  Independent  and  identically  distributed  (i.i.d.) 
noise-generating  sequence  with  marginal  probability  density  function  (p.d.f.)p, 
and  where  p  is  a  fixed  parameter  indexing  the  degree  of  dependence  among  the 
noise  samples.  It  was  shown  in  [1]  that  an  appropriate  detector  for  this 
situation  with  |p|  small  is  of  the  form  of  the  corresponding  optimum 
independent-noise  (p  ■  0)  system  with  an  added  linear  correction  factor 
depending  on  the  degree  of  dependence  as  parameterized  by  p  and  on  the 
particular  noise  statistics  as  determined  by  p.  It  was  also  seen  in  [1] 
that  appreciable  improvement  over  the  corresponding  independent-noise  system 
can  often  be  expected  from  the  proposed  system,  particularly  for  channels 
dominated  by  impulsive  noise. 


-  .  •  k  ’  1  ’  '•-«* ,  *>  .  \  *-*.»•  *  w*i.  -  . 


The  specification  of  Che  detection  system  proposed  in  [1]  requires  e 
complete  knowledge  of  the  statistics  of  the  noise  process  (n^j  i  -  1,2,...,*} 
as  determined  by  p  and  p.  An  important  practical  modification  of  this 
problem  is  the  situation  in  which  the  noise  distribution  is  not  known 
exactly  but  rather  is  known  to  be  in  some  class  of  possible  noise 
distributions  representing  uncertainty  in  a  nominal  noise  model.  In  this 
situation  a  detector  is  desired  whose  performance  does  not  deteriorate 
drastically  over  the  class  of  possible  noise  statistics.  A  detection  system 
with  this  general  property  is  usually  known  as  a  robust  detector.  Several 
Investigators  have  considered  the  problem  of  robust  detection  in  the  model 
of  (1)  for  the  case  in  which  the  noise  sequence  is  independent.  General 
results  for  robust  hypothesis  testing  in  a  model  which  Includes  the 
independent-noise  case  of  (1)  have  been  obtained  by  Huber  [2]  and  by  Huber 
and  Strassen  [31  within  a  mini max  risk  (or  error  probability)  formulation 
and  by  the  author  [4]  within  a  maxim! n  distance  formulation.  Martin  and 
Schwartz  [51  have  considered  robust  detection  in  the  model  of  (1)  for  the 
case  in  which  the  independent  noise  process  is  a  mixture -contaminated 
Gaussian  process.  Within  this  context  both  mini max  risk  and  maxi min  local 
power  slope*  are  treated  in  [5].  Kassam  and  Thomas  [8]  have  extended  the 
results  of  [5]  to  solve  die  ’<axi mi n-power- slope  formulation  for  the  case  of 
mixture-contaminated  nonGausslan  noise  processes,  and,  in  an  asymptotic 
formulation,  El-Sawy  and  VandeLinde  [91  have  treated  the  minimax  risk 
problem  for  (1)  with  general  uncertainty  classes  for  the  marginal  distribu¬ 
tion  of  the  independent  noise  sequence.  A  sequential  version  of  (1)  Is 
treated  in  a  similar  context  by  El-Sawy  and  VandeLinde  in  [10]. 


^Recall  that  the  local  power  slope  is  the  criterion  for  designing  locally 
most  powerful  detection  systems  (see  Capon  [61  or  Ferguson  [71). 


Considerably  less  work  has  been  done  on  die  problem  of  robust  detection 
for  the  situation  of  (1)  with  dependent  noise,  although  the  related  problem 
of  nonparametric  (or  distribution- free)  detection  in  (1)  with  dependent 
noise  has  been  considered  in  some  detail  (see,  for  example,  Kanefsky  and 
Thomas  [11],  Gastwirth,  et  si.  [12],  Davisson,  et  al.  [13],  Kaasam  and  Thomas 
[14], and  Sirvancl  and  Wolff  [15]),  as  has  the  statistically  analogous  problem 
of  robust  estimation  of  location  with  dependent  errors  (see  Hoy land  [16], 
Gastwirth  and  Rubin  [17],  Portnoy  [18,19],  and  Koul  [20]).  In  this  paper  we 
consider  robust  detection  in  (1)  for  situations  in  which  the  noise  can  be 
modeled  by  a  weakly  dependent  moving -average  process  as  described  by  (2) 
with  small  |p | .  Since  the  parameter  p  can  be  estimated  straightforwardly  from 
an  observation  of  (x^;  1*1,2,...,*}  (see  Eq.  (53)  of  [1])  we  model  statistical 

uncertainty  in  the  noise  sequence  {N^j  1  “  1,2 . »}  by  assuming  that  the 

p.d.f.  p  of  the  noise-generating  sequence  (Y^;  i  €  Z}  is  not  known  exactly 
but,  rather,  is  known  only  to  be  a  member  of  a  class  9  of  symmetric  probability 
densities.  Note  that,  for  analytical  reasons,  the  criterion  of  asymptotic 
detection  efficiency  (which  was  applied  in  [1]  for  optimum  design)  is  not 
suitable  for  robust  design  in  this  situation  (see,  for  example,  Martin  and 
deMontricher  [21]).  Thus,  we  adopt  a  Neyman- Pearson  design  philosophy  and, 
for  a  particular  detector  cp,  consider  the  probabilities  of  false  alarm  and 
detection,  P^  and  P^,  respectively,  given  for  p  €  5  by 

Pp^lp)  ■  P&P  chooses  HjJHq  is  true  and  ~  p  T  1  (  Z)  (3) 

and 

PD(<p|p)  ■  P[«p  chooses  Is  true  and  Y^  ~  p  V  i  €  Z}  .  (4) 
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Adopting  the  game-theoretic  philosophy  of  robust  design  (as  proposed  by 
Huber  [2])  we  may  define  an  robust  detector  for  this  situation  as 

one  which  solves  the  game 

max  (  inf  P—CP Jp)}  subject  to  sup  P_(cp |p>  <t  a  (5) 

<P€£p€?u  p€? 

where  &  is  an  appropriate  class  of  detectors.  Note  that  (5)  reduces  to  the 
traditional  Neyman-Pearson  design  criterion  if  5  consists  of  a  single  p.d.f.  [7]. 

In  this  paper  we  seek  the  solution  to  (5)  for  the  small-|p|  situation  in 
(1)  and  (2)  by  restricting  &  to  be  the  class  of  M-detectors  as  proposed  by 
El-Sawy  and  VandeLinde  in  [9].  In  so  doing  we  are  able  to  exploit  related 
results  obtained  by  Portnoy  [18,19]  for  the  problem  of  robust  estimation  of 
a  location  parameter  with  dependent  observation  errors  modeled  by  (2)  with 
small  |p|.  The  class  of  detectors  to  be  considered  here  is  defined 
specifically  in  Section  II  and  first-order  (in  p)  approximations  to  their 
asymptotic  false-alarm  and  detection  probabilities  are  developed.  The  structure 
of  M-detectors  that  are  optinum  with  respect  to  these  approximations  is  also 
presented  in  Section  II,  and  it  is  seen  that  the  system  modification  to 
account  for  weak  noise  dependency  in  M-detectors  is  similar  to  that  found 
for  the  detectors  considered  in  [1].  The  first-order  approximations  to  the 
error  probabilities  developed  in  Section  II  suggest  a  first-order  approxima¬ 
tion  to  the  mini max  problem  of  (5),  and  Section  III  considers  the  solution  to 
this  first-order  version  of  (5).  In  particular,  it  is  argued  that  a  small- lp] 
robust  design  for  this  problem  is  the  small-jp|  optimum  design  corresponding 
to  a  least- favorable  noise-generating  p.d.f.  Moreover,  it  is  seen  (as  in 
the  analogous  situation  of  [18])  that  this  least- favorable  noise-generating 
p.d.f.  does  not  depend  on  the  value  of  p  and  thus  is  the  same  as  the  p.d.f. 


that  Is  least  favorable  for  the  Independent  noise  case,  'nils  latter  result 
Is  useful  because  the  corresponding  lndependent-nolse  problem  has  been  solved 
previously  for  a  number  of  noise  uncertainty  models  (see,  for  example, 

[5,8,9]).  In  Section  IV,  we  consider  a  modification  of  the  system  developed 
In  Section  III  because  of  qualitative  disadvantages  of  the  system  of 
Section  III.  It  is  demonstrated  analytically  in  Section  IV  that  this 
modification  is  equivalent,  to  first  order  in  p,  to  the  detector  developed 
in  Section  III,  a  fact  which  supports  the  use  of  the  modification  intuitively 
since  the  modified  system  is  superior  to  the  unmodified  one.  Also  in 
Section  IV,  the  specific  example  of  a  contaminated  Gaussian  noise  model 
(as  proposed  by  Huber  [22])  is  considered  in  some  detail  in  order  to  illustrate 
the  proposed  robust  design  procedure.  In  Section  V  we  return  to  the  general 
situation  to  consider  the  degree  to  which  the  worst-case  performance  of  the 
proposed  robust  detector  outperforms  the  corresponding  independent-noise 
(p  ■  0)  robust  detector.  In  particular,  it  is  shown  that  this  performance 
difference  is  of  first  order  in  p,  a  fact  which  indicates  that  the 
consideration  of  dependence  is  even  more  important  in  the  robust  design 
problem  than  in  the  corresponding  optimum  design  problem  of  [ 1] .  (The 
corresponding  performance  difference  in  [1]  is  of  second  order  in  p.) 

Finally,  Section  VI  includes  some  further  discussion  of  the  results  of  this 
paper  and  of  soma  possible  extensions  of  these  results. 


El-Sawy  and  VandeLinde  [9]  have  solved  the  problem  of  (5)  for  the 
asymptotic  (a  -»  •)  case  with  Independent  noise  (p  -  0)  when  the  class  Jfr 
Is  restricted  to  contain  only  detectors  of  a  specific  structure  known  as 


M-de tec tors .  This  class  of  detectors  is  based  on  a  class  of  location 
estimates  known  as  M-estl mates  introduced  by  Huber  in  [22],  and  certain 
members  of  this  class  derive  their  robustness  properties  from  analogous 
properties  for  robust  estimation.  Robust  M-estimates  of  location  for  the 
weak  dependence  model  of  (2)  with  small  |p|  have  been  considered  by  Portnoy 
in  [18]  and  [19],  and  thus  it  is  reasonable  to  consider  the  related  class  of 
M-detectors  to  seek  a  solution  to  the  analogous  problem  of  (5)  with  |p |  small 
but  nonzero. 

We  therefore  restrict  &  to  contain  only  detectors  of  the  following  form 


1  i  if  ^(x)  >  T 

y  ;  if  (*)  *  t 

0  ;  if  (x)  <  t 


where  8^ (x)  is  a  solution  to  the  equation 


(6) 


Z  t  (x.  -  T)  |  ^  -  0  (7) 

i-1  1  T-0  (x) 


with  f  an  arbitrary  function,  known  as  the  influence  curve  of  8^  (see  also 

Hampel  [23]  and  Huber  [24])  characterizing  the  detector  <p  .  Here  «p  (x) 

f  v 

denotes  the  probability  with  which  we  accept  H^  given  that  x  is  observed, 
and  the  threshold  Y  and  randomization  t  are  chosen  to  yield  desired 
false-alarm  performance.  A  detector  of  the  form  (6)  is  known  as  an 
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M-detector  [9]  and  the  deration  statistic  0^(x)  of  (7)  is  an  M-estimate 
[221.  Note  that  the  M-estlmate  is  a  generalization  of  the  independent- 
noise  maximim- likelihood  estimate  of  location  of  X  which  corresponds  to 
the  particular  choice  of  influence  curve  f(x)  *  d  log[p(x)]/dx  (see,  for 
example,  Silvey  [25]).  Moreover,  in  the  independent  case,  the  M-detector 
corresponding  to  the  maximum- likelihood  M-estimate  is  asymptotically 
equivalent  to  the  likelihood  ratio  test  of  Hq  versus  (see,  for  example. 
Lemma  3  of  [9]).  Thus,  the  restriction  to  detectors  of  the  form  of  (6)  is 
reasonable  for  the  small- |p |  and  large-n  case  in  that  it  does  not  eliminate 
the  optimum  detector  for  any  member  of  7  from  consideration  when  p  *  0.  Note 
further  that  the  class  of  M-estimates  includes  the  sample  mean  (given  by 
f(x)  ■  x),  for  which  the  corresponding  M-detector  is  the  linear  detector, 
as  well  as  the  sample  median  (given  by  f(x)  *  sgn(x)). 

Note  that,  for  many  choices  of  the  influence  curve  f,  (7)  will  sometimes 
have  multiple  solutions;  however,  for  analytical  (and  implementatlonal) 
purposes,  we  would  like  to  specify  a  particular  solution  to  (7).  Thus,  if 

A 

for  a  given  x  there  are  multiple  solutions  to  (7)  we  will  choose  0  (x)  to  be 

-  A  1  n  ’ 

the  solution  closest  to  the  sample  mean  x  *  —  Ex.,  and,  if  there  are  two 

n  i-1  1 

•  A 

solutions  equidistant  from  x,  we  choose  0^ (x)  to  be  the  larger  of  the  two. 
Also,  if  (7)  has  no  solution  for  a  particular  x  we  take  0^ (x)  ■  0.  With  this 

A 

construction  of  0^(x)  we  may  state  the  following  result  which  follows  from 
Theorem  A. 2  of  Portnoy  [18]. 

2  A 

Theorem  1:  Assume  the  model  of  (1)  and  (2).  Suppose  a  ■  Var(Y^)  <  •,  f  is 
continuous  and  bounded,  E[f(N^  +  0)}  is  strictly  increasing  in  a  neighborhood 

of  0*0,  and  E (N^) )  ■  0.  Then  0^(X)  converges  in  probability  to  0  under 

and  to  0  under  Hq  as  n 


.  • 


/'■  A  A  .  •  V-  A  V*  A  .  * 


Proof;  For  the  model  of  (1)  and  (2)  with  a  <  •  it  follows  from  the 


Chebychev  Inequality  and  from  Leoma  3  of  Billingsley  [26  ,  p.  172]  that 
1  n 

—EX.  converges  in  probability  to  9  under  H,  and  in  probability  to  0 
“  i-1  1  1 

under  Hq  as  n  ■».  This  and  the  fact  that  {N^;  1  -  1,2,...,*}  is 

2-dependent  (i.e.,  and  Nj  are  independent  if  | i-j |  >  2)  imply  Theorem  1 

2 

via  Theorem  A. 2  of  [18]. 

Thus,  under  the  mild  conditions  of  Theorem  1,  we  see  that  the  M-detector 

<P  of  (6)  provides  a  consistent  test  of  Hq  versus  provided  that  the 

threshold  t  is  between  0  and  8.  That  is,  with  t  €  (0,6)  has  the  property 

that  false-alarm  and  detection  probabilities*?^  and  P^,  converge  to  0  and  1, 

respectively,  as  the  number  of  samples  n  approaches  •  .  Consistency  is,  of 

course,  the  very  least  that  we  should  require  of  a  detection  procedure; 

thus  in  order  to  optimize  over  the  class  3,  it  is  of  further  interest  to 

approximate  the  large-n  performance  of  the  detectors  of  this  form  for  the 

weak  dependence  model  of  (2).  For  this  purpose  we  may  state  the  following 

result  which  follows  straightforwardly  from  Theorems  2.1  and  A. 4  of  [18]. 

Theorem  2;  Assume  the  hypothesis  of  Theorem  1.  Suppose  in  addition  that 

t  is  differentiable  except  as  a  closed  set  D  of  Lebesgue  measure  0, 

that  is  uniformly  continuous  off  of  D,  that  0  £  D,  that  E{t'(N^)} 

and  B ' (Y^ ) }  are  positive,  and  that  the  characteristic  function  of 

satisfies  J*  u  |$Y(u)Jdu  <  «.  Then,  with  ^  -  t^+6  for  i-l,2,...,«,  the 

quantity  n  ^(^(X)  -9)  converges  in  distribution  to  a  Gaussian  random 

2  3 

variable  with  mean  zero  and  variance  a  (f,p;p)  given  by 


2 

Note  that  the  sample  mean  can  be  replaced  by  any  other  consistent  estimate  of 
8  in  this  analysis  (see  [18]),  in  which  case  the  condition  a*  <  «  might  be 
relaxed  in  Theorem  1. 

^As  in  [1],  by  0(p2)  we  mean  lim  (o(p2)/p2j  <  •. 


o  ($,pip)  -  V(^,p;p)  +  0(p  )  (8 

where 

0?,)}  ECY.tor,)} 

v(f,p;p)  - *— tt  +  4p — * — —  (9 

CeU'CYj)})  eC^'Ot^} 

with  expec cations  computed  under  Che  assumption  Chet  ~  p. 

Discussion:  The  validity  of  Theorem  2  relies  on  two  basic  results:  a 
central  limit  theorem  (Theorem  A. 4  of  [18])  yielding  the  asymptotic 

A 

normality  of  0^(X)  and  an  approximation  theorem  (Theorem  2.1  of  [18]) 

yielding  (8) .  The  proof  of  the  central-limit  part  of  this  theorem  relies  in 

part  on  a  Berry-Esseen  type  theorem  for  m-dependent  random  variables  due  to 

.v 

Stein  [27,  Corollary  3.1]  which  gives  an  error  bound  of  0(n  )  for  the 

A 

normal  approximation  to  the  distribution  of  n  (8^ (X)  -8).  Thus,  under 
the  conditions  of  Theorem  2,  we  may  write  the  false-alarm  and  detection 
probabilities  of  cp^  as 

PyGpj|p)  “  1  -$(n^i7  [V(*,pjp)  +0(p2)]*)  +  0(n"*)  (1 

and 

PD%lp)  "  1-#(»*(t -0)/[V(t,p;p)+O(p2)]*)+O(n“*),  (1 

respectively,  where  i  is  the  standard  normal  distribution  function 


»<*>  -  -i  I 

V2tt  -• 


*  -t2/2 
e  c  dt. 


In  view  of  (10)  and  (11)  we  adopt,  for  analytical  purposes,  the  following 
large-sample  week-dependence  approximations  to  P_  and  P_  , 


10 


PF(«pjJP)  “  1  -*(n^T/  (V(*,p;p)]*) 

(13) 

$DCspj»P)  ■  l-«(n^(r -8)/[V(^,p;p)l^)  . 

(1A) 

Note  that,  for  a  fixed  noise-generating  density  p  €  7  and  threshold 
t  €  (0,8),  the  error  probabilities  of  (13)  and  (14)  can  be  optimized 
simultaneously  by  minimizing  the  functional  V(f,p;p)  over  an  appropriate  class 

of  influence  curves.  Note  also  that,  for  fixed  p,  we  have 

V(*,p;p)  -  [e(tJp)]"1  (15) 


where  e(f ;p)  is  the  first-order  approximation  developed  in  Part  I  of  this 
study  (see  Lemma  1  of  [1])  for  the  efficacy  or  differential  signal-to-nolse 
ratio  of  a  detector  for  (1)  and  (2)  based  on  comparing  the  detection  statistic 

E  ♦(X1)  (16) 

i-1  X 

to  a  threshold.  Thus,  the  criterion  of  maximum  e(f ;p)  which  was  developed 
in  Part  I  in  the  context  of  efficient  detection  for  (1)  and  (2)  with  the 
structure  of  (16)  is  equally  valid  for  approximately  optinum  detection 
in  the  situation  with  the  structure  of  (7).  In  particular,  by  applying 
Theorem  1  of  [11  we  have  that,  within  mild  regularity  on  p,  the  problem 
m|n  V(f,p;p)  is  solved  for  fixed  p  by  the  influence  curve 

♦  (x)  -  -p*  (x)/p(x)  -  2pl(p)x/(l  +  2p)  (17) 


where  p'  denotes  the  derivative  of  p  and  where  I(p)  is  Fisher's  information 
number  for  location  of  p  defined  by 

I(P)  *  /  [(p'(x))2/p(x)]dx  .  (18) 


Mote,  however,  that  the  influence  curve  defined  in  (17)  does  not  satisfy  the 

hypotheses  of  Theorems  1  and  2  above  since  this  function  is  not  bounded. 

* 

However,  as  is  shown  in  Section  IV  below,  the  function  of  (17)  can  be 
approximated  by  a  bounded  influence  curve  that  yields  essentially  the 
same  performance  for  small  |p|.  This  issue  is  discussed  further  in 


Section  IV 


III.  Robust  Detection  In  Weakly  Dependent  Noise 

The  analysis  of  the  above  section  Indicates  that  the  quantities  of 
(13)  and  (14)  can  be  used  to  approximate  the  probabilities  of  false  alarm 
and  detection,  respectively,  of  M-de tec tors  operating  in  the  presence  of  the 
noise  process  of  (2)  with  small  |p|.  Thus,  In  order  to  design  M-detectors 
that  are  robust  in  the  presence  of  weak  dependence,  we  may  consider  the 
"“»•*<  "*<n  formulation  defined  by  (5)  as  applied  to  the  error-probability 
approximations  of  (13)  and  (14). 

To  seek  solutions  to  this  problem  we  first  consider  the  alternate 
mini max  problem 

min  msxV(t,p;p)  (19) 

♦  €  T  P  6  5  ‘ 

where  the  functional  V  is  from  (9)  and  where  Y  Is  an  appropriate  class  of 
Influence  curves.  We  note  that  a  saddle-point  solution  to  (19)  will  be  a 
pair  (tR»PR)  6  Y  x  5  satisfying 

«•*  V(t_,p;p)  -  V(YR,p_;p)  -  min  v(*,p  ;p),  (20) 

p  €  y  *  rk  Y^Y  R 

and  the  existence  of  such  a  pair  is  equivalent  to  the  validity  of  the 
odnimax  property  (see,  for  example,  Barbu  and  Precupanu  [28]) 

min  max  V(f,p;p)  -  max  min  V(f,p;p)  .  (21) 

Y  €  Y  p  €  y  p  €  5  f  €  Y 

Note  that,  if  (tR»PR)  €  Y  x  y  satisfies  (20),  then  it  follows  from  (13) 
and  (14)  that  for  any  threshold  t  €  (0,9)  we  have 


— »  -*  .-*  j*._ 


fp(9*  IP)  -  fF%  ;pR> 


p€5‘  fR 


nln  wax  ?  (tp  ;p>  -  P  <jp  ;p  ) 
f  €  ?  p  €  5  D  *  D  *R  R 

-  min  P  (cp  ;p) 
P  €  y  fR 


For  a  particular  choice  a  of  felse~elerm  probability,  the  threshold  t  given  by 

R 

tr  -  n'%[V(tRpR;p)]%  *“1(l-a)  (24) 

will  yield  a  value  of  P  («  »p_)  *  a  .  Thus,  in  view  of  (22)  and  (23),  if  t_ 

V  h  R  R 

of  (24)  is  between  0  and  6  then  the  pair  (tR»PR)  is  *  saddle-point  solution 

A  ^ 

to  (5)  with  PQ  and  Pp  replaced  by  P^  and  Pp,  respectively.  Note  that  the 
condition  tr  >  0  is  satisfied  if  a  <  %,  which  is  the  only  range  of  interest 
for  a.  However,  the  condition  tr  <  0  places  a  lower  bound  on  the  signal 
strength  6,  relative  to  a  and  n,  for  which  mlnimaxity  can  be  achieved  with 

A  * 

PD  and  Pp.  Further  discussion  of  this  point  is  included  in  Section  VI 
below. 

It  is  noteworthy  that  a  pair  solving  (20)  satisfies  (via  (10)  and  (11)) 

PF(<Pj  ;p)  £  Pp(<Pj  JPR)  +  0(p2)  +  0(n‘*)  (25) 

R  R 

and 

PD%  JP)  2  PD%  *Pr)  +  0(p2)  +  0(n'*)  (26) 

R  R 

for  every  p  €  ?#  However,  corresponding  0(p  )  end  0(n  *)  statements 


V  ■/  VVv  V’/’nWV'.'V*/ 


wvuw«4iUiU5  auu  9r /  ****  VT*  )r/  vwmwi.  »— «  »  »**•*»■»  •**» 

p  €  7  F  »r  P  €  7  D  *r 

0(p2)  and  0(n”^)  term*  in  (8),  (10),  and  (11)  are  uniform  on  7.  Nevertheless, 
because  of  (25)  and  (26),  a  density  satisfying  (20)  can  be  considered  a 
leas t- favorable  member  of  7  for  the  detection  problem  of  interest. 

In  view  of  the  above  comments,  we  see  that  a  solution  tR  to  (19) 
yields  an  M-detector  with  desirable  week-dependence  large-setup le  robustness 
in  performance  over  the  class  7.  The  problem  of  (19)  has  been  studied 
in  [18]  in  the  context  of  robust  M-estimation  of  location.  Note  that,  if 
(9r,Pr)  satisfies  (20)  and  if  pR  and  the  members  of  Y  satisfy  the 
hypothesis  of  Theorem  1  of  [1],  then  Theorem  1  of  [1]  Indicates  that  fR 
must  be  given  (up  to  a  scale  factor)  by 

♦r(k)  -  t0(*)  -V  ;  x  €  *  (27) 

where  1Q  -  -p£/pR  and  -  2pI(pR)/(l  +  2p),  provided  this  function  (27)  is  a 
member  of  Y.  (Recall  that  I(p)  denotes  Fisher's  information  number  given 
by  (18).)  Using  Eq.  (36)  of  [1]  we  have  (see  also  Eq.  (3.8)  of  [18])  for 
fixed  p  €  7 

min  V(*»p;p)  -  (l  +  4p)[I(p)]"1  +  0(p2) 

-  (l  +  4p)ndn  V(t,p;0)+0(p2)  .  (28) 

Thus,  provided  Y  contains  the  appropriate  Influence  curves,  we  may  argue 

2 

(as  in  [18])  that  to  0(p  )  the  density  p_  solving  max  min  V(f,p;p) 

R  p  €  7  f  €  Y 


does  not  depend  on  the  value  of  p.  Theorem  1.1  of  [18]  gives  e  more  precise 
reinforcement  of  this  ergument.  Noting  from  (28)  thet 

«in  V(t;p;0)  -  IKp)]"1  ,  (29 

e  saddle  point  solution  to  (19)  thus  can  be  sought  by  first  choosing 
pj  €  5  to  solve 

sdn  I(p)  (30 

p  €  y 

and  then  choosing  from  (27).  Note  that  the  resulting  robust  Influence 
curve  is  a  linearly  corrected  version  of  the  Influence  cruve  for  robust 
M-detection  in  independent  noise  (i.e.,  $q)  as  derived  in  [9].  This  solution 
is  thus  completely  analogous  to  the  corresponding  result  for  optimum 
detection  with  known  p  derived  in  [1]. 

The  existence  and  uniqueness  of  solutions  to  (30)  and  their 
relationship  to  solutions  to  (19)  with  p  ■  0  have  been  studied  in  detail  by 
Huber  in  [22].  In  particular,  it  follows  from  Theorem  2  of  [22]  that  if  5 
is  convex  and  I(p)  <  •  for  all  p  €  y,  then  a  density  pR  solving  (30)  and 
its  p  *  0  optimum  Influence  curve  m  “P r/Pr  is  *  saddle  point  solution 
to  (19)  with  p  ■  0  if  6  T.  Furthermore,  Theorem  4  of  [22]  implies  that, 
if  y  is  also  vaguely  compact,  then  there  is  a  unique  member  of  y  solving 
(30).  Explicit  solutions  to  (30)  for  several  density  classes  of  Interest 
and  other  aspects  of  the  problem  of  (30)  are  found  in  Huber  ([22]  and  [29]) 
and  in  Sullivan,  et  al.  [30]. 


IV.  A  Modification  of  the  Mud. ala  Solution 

As  noted  above,  the  analysis  of  Section  IV  indicates  that  a  robust 

M-detector  for  |p|  j*  0  has  an  influence  curve  that  is  a  linearly  corrected 

version  of  the  influence  curve  of  the  p  ■  0  robust  M-detec tor  as  studied  by 

El-Sawy  and  VandaLinde  in  [9].  This  structure  is  undesirable  for  two  reasons. 

First,  as  noted  in  Section  II t  an  Influence  curve  with  a  linear  component  is 

not  bounded  and  thus  does  not  satisfy  the  conditions  needed  for  the  validity 

of  Theorems  1  and  2.  This  is  only  an  analytical  disadvantage  which  can  be 

surmounted  without  too  much  difficulty.  However,  a  more  important  objection 

to  this  detector  is  that  the  unboundedness  of  the  Influence  curve  violates 

basic  Intuitive  principles  of  how  robustness  is  achieved  in  a  detector. 

In  particular,  most  robust  detectors  provide  a  means  for  limiting  the  effects 

of  extraordinarily  large  observations  (outliers)  which,  if  not  accounted  for, 

tend  to  destroy  detection  performance  (see,  for  example  [5]).  This  objection 

was  also  raised  in  [18]  where,  for  the  particular  case  in  which  9  represents 

contaminated  Gaussian  noise,  a  truncated  or  lightly  limited  version  of  is 

shown  to  produce  an  M-estimate  which  differs  in  worst-case  performance  from 
2 

by  only  0(p  ).  In  this  section  we  consider  a  similar  modification  of  the 
robust  M-detector  developed  in  Section  III  in  a  slightly  more  general  context. 

Since  the  robust  Influence  curve  of  (27)  is  objectionable  because  of 
its  unboundedness,  it  is  reasonable  (as  suggested  in  [18])  to  introduce 
light  limiting  into  this  structure  to  produce  a  bounded  approximation  to 
It  is  usually  the  case  that  the  independent-noise  robust  influence  curve 
is  bounded  (see  [9]  and  [22]);  so  it  is  usually  the  linear  correction  term, 
-K_x,  that  produces  the  unboundedness  of  f  .  Thus  a  reasonable  modification 


of  $R  is  to  replace  the  term  KgX  with  a  lightly  limited  version  (x) 
where  l (x)  is  a  light  limiter  defined  by 


/  x 


i(x)  -  ^ 


J  if  |x|  £  L 
l  L  sgn(x)  ;  if  |x|  >  L 


with  L  a  positive  constant.  Note  that  Chebychev's  Inequality  lop lies  that 

the  probability  that  an  individual  observation  exceeds  the  limiting  point  L 

-2 

is  bounded  by  an  upper  bound  proportional  to  L  .  Thus,  by  choosing  L 

sufficiently  large,  we  should  be  able  to  make  the  effects  of  replacing  x 

by  l(x)  in  fg  negligible.  In  particular,  if  we  choose  L  to  be  0(p~^) ,  then 

2 

the  effects  of  this  replacement  should  be  0(p  ).  With  this  motivation,  we 
thus  propose  replacing  of  (27)  with  the  modified  influence  curve 
defined  by 


♦Moo 


'  tR(x)  ;  if  |x|  £  l/kp 

<  <3 

t0(x)  -  Kgk” 1#gn(x)  ;  if  |x|  >  l/kp 


where  kp  -  0(p)  and,  as  before,  f q  -  -pR/pR.  For  this  structure  we  then 
have  the  following  result. 

Theorem  3:  Suppose  7  is  such  that  (30)  has  a  solution  pR  with  fg  -  -p^/pR 
satisfying  the  conditions  of  Theorems  1  and  2.  Then,  for  each  p  €  7  such 
that  EtY2}  <  *,  we  have 

V(tM,p;p)  -  V(f  ,p;p)  +  0(p2)  . 


C 
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o 

Furthermore,  if  e(Y^}  S  B  <  •  for  all  p  €  5,  then 


»«P  V(*M,p;p)  -  sup  V(*  ,pjp)+0(p  )  , 


p  €  9 


p  €  y 


provided  either  p  ^  0  or  p  <  0  and  (p|  is  sufficiently  small. 

A  proof  of  Theorem  3  is  included  in  the  appendix.  As  suggested  above, 

this  proof  relies  upon  (among  other  things)  Chebychev-type  bounds  on 

probabilities  that  the  magnitudes  of  the  observations  exceed  lc”*-.  We  see 

that,  under  the  conditions  of  this  theorem,  the  truncated  version  of 

yields  a  detector  whose  wors  t-case  performance  is  essentially  equivalent 

to  that  of  <p  .  Note  that  if  the  conditions  of  Theorem  3  are  not  satisfied 
fR  2 

(i.e.,  if  there  is  a  p  €  5  such  that  e{Y^}  ■  •)  then  the  M-detector  based 

on  f  will  have  very  poor  wors t-case  performance  relative  to  that  of  <p  . 

R  fM 

Thus  we  may  conclude  generally  that  «p.  is  preferable  to  o  from  both 

practical  and  analytical  viewpoints. 

To  illustrate  the  design  of  a  robust  M-detector  and  its  modification  as 

suggested  by  (32),  we  consider  a  specific  example  treated  previously  in 

various  contexts  of  robust  design  by  Huber  [22],  Martin  and  Schwartz  [5], 

and  Portnoy  [18].  In  particular,  we  consider  the  class  of  contaminated 

Gaussian  densities  defined  by 

*0  -  Cp|p  -  (l-o?  +  *  h;  h  €  X)  (32 

where  5(x)  -  (2Tt)"exp{-x  /2}  is  the  standard  Gaussian  density,  «  is  a  fixed 
number  between  0  and  1,  and  X  is  a  wide  class  of  synnstric  pdf's.  Note  that 
5q  thus  defined  can  be  considered  to  be  an  uncertainty  neighborhood  of  a 
nominal  Gaussian  model  with  a  degree  «  of  uncertainty  in  this  model.  The 


.  V.  V 
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density  minimizing  Fisher's  Information  I(p)  ovar  Is  givan  by  Hubar  in 
[22]  and  has  tha  vail -known  form 


PR(x)  - 


a  -  c)?(x) 


;  M  *  k 


(36) 


(1  -  *)?(k)axp(-k|  |x j  -  k|}  ;  |x|  >  k 
where  k  Is  tha  unique  positive  solution  to  tha  equation 


2f(k)  -1  +  2§<k)/k  -  (1-s) 


-1 


(37) 


The  p  ■  0  robust  influence  curve  it  thus  given  by 


♦  n(x) 


i  |*|  *  k 


(38) 


k  sgn(x)  j  | x |  >  k 
and  the  |p|  +  0  robust  influence  curve  is  (from  (27)) 


♦R(x) 


(1-VJ 


;  (x|  £  k 


(39) 


'  k  *ga(x)  -  ^x  ;  |x|  >  k  , 


which,  for  tha  case  p  >  0,  Increases  linearly  in  [-k,k]  and  decreases 


linearly  in  [-k,k]c  .  Note  that  tha  value  of  I(PR)  ia  givan  for  this 


case  by 


I(PR>  -  (l-«)(2*(k)-l)  , 


(40) 


which,  of  course,  must  decrease  monotonically  with  «.  Using  (37)  and  (40) 
tha  values  of  the  parameters  k  and  (recall  that  *  2pI(pR)/(l+ 2p)) 
can  be  computed  for  given  c  and  p.  For  example  the  case  s  »  0.1  and  p  *  0.1 


:»>«»< 


yields  k  *  1.14  and  •  .115.  A  reasonable  bounded  modification  for  the 
Influence  curve  of  (39)  with  p  >  0  Is  that  suggested  in  [18],  namely 


(i-V 


s  1*1  Sk 


♦Mco 


k  sgn(x)  -  Kjk  ;  k  <  |x|  S  k‘ 
0  i  |x|  >  k'1 


where  kp  *  K^/k  which  is  0(p)  since  is  0(p).  Figure  1  illustrates  this 
function  for  the  case  e  ■  0.1  and  p  ”  0.1.  Further  discussion  of  this  and 
related  examples  is  Included  in  Sections  V  and  VI  below. 
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V.  Performance  of  the  Proposed  Robust  System  Relative  to  the 
Independent-Noise  Robust  System 

In  Section  III  we  argued  that  the  least- favorable  noise-generating  density 

pR  is  independent  of  the  value  of  the  dependence  parameter  p.  Thus,  for 

each  value  of  p ,  the  robust  influence  curve  is  the  optimum  influence 

curve  (i.e. ,  the  solution  to  min  V(t, PB»P)>  corresponding  to  the  fixed  density 

♦  R 

pR.  It  follows  from  the  analysis  in  Part  I  of  this  study  (in  particular,  from 

Theorem  2  of  [1])  that,  within  regularity  conditions,  the  quantities  v(tR»PR»P) 

2 

and  v(tg»PR»P)  differ  only  by  0(p  )  terms  where,  as  before,  -  “PR/pR* 

Since  is  the  Influence  curve  one  would  use  for  robustness  if  there  were  no 

dependence,  a  question  arises  as  to  whether  the  worst-case  performance  of 

2 

{p  over  7  might  be  only  0(p  )  different  from  that  of  cp.  .  That  this  is  not 

fo  h 

generally  the  case  is  shown  in  this  section;  in  particular,  we  demonstrate  that 
this  difference  in  worst-case  performance  is  actually  0(p)  for  most  uncertainty 
models  of  Interest. 

To  consider  the  worst-case  performance  of  «p  relative  to  the  worst-case 

fR 

performance  of  <p  for  fixed  p  we  first  give  the  following  result. 

*0 

Lemma  1:  Suppose  pR  €  7  solves  (30)  with  0  <  I(PR)  <  ••  Suppose  further  that 

2 

there  are  numbers  b  and  B  such  that  0  <  b  £  ECtR(Y^)}  ^  B  <•, 

EtrJ}  £  B,  and  b  £  E{|q}  £  B,  for  all  p  6  7  where  fg  -  -pR/pR.  Then,  if 
p  S  0  or  if  p  >  0  and  is  sufficiently  small,  we  have 


v(tR,p;p)  2 

- 1  +  V  +  °<>  >  ■ 


(42) 


*It  should  be  noted  here  that  V(tg,pR;0)  is  the  worst-case  value  of  V($g,p;0) 
over  p  €  7,  but  it  is  not  necessarily  true  that  V(tg,pR;p)  is  the  worst-case 
value  of  V(tg,p;p)  over  7  for  p  0.  In  fact,  this  lattar  situation  is  not 
usually  the  case,  as  follows  from  Lemma  1  and  Theorem  4. 
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where 

4l(pR)V(t0,p;-%) 

°P  “  EftjOr^JVftQ.piO)  »  (43) 

2 

and  where  0(p  )  is  uniform  over  7. 

The  proof  of  Leona  1  is  straightforward  and  will  be  omitted. 

This  result  shows  that,  although  v(fR»PR»P)  and  V(t0»P15P)  differ  by  only 
2 

0(p  )  terms,  V(fR,p;p)  and  V(tQ,p;p)  nay  possibly  differ  by  0(p)  terms  if  p*pR. 
(Note:  it  follows  straightforwardly  from  (9)  that  V(tg,pR;-fc)  -  0.)  It 
follows  from  Lenina  1  that,  for  fixed  p,  we  have 

sup  V(*Q,p;p)  -  sup  V(t_,p;p)  +  C|p|  +  0(p2)  (44) 

p€5  p€£ 

where  C  -  sup  C  if  p  >  0  and  C  -  |  inf  C  |  if  p  <  0.  He  note,  in  particular, 
p€*p  p€*p 

that  (44)  can  not  hold  for  any  C  <  0  since  we  must  have 

sup  V(tQ,p;p)  a  V(fQ,pR;p)  -  V<*R»PR;P)  +  0(p2) 
p  €  7 

-  sup  V(*R,p;p)  +  0(p2)  (45) 

P  €  S'  R 

where  the  first  equality  in  (45)  follows  from  Theorem  2  of  [1].  Since  (45) 
is  valid  for  both  positive  and  negative  p,  (45)  and  (42)  imply  that  either 
V (f q,p j "%)  takes  on  both  negative  and  positive  values  over  7  or  V(t0,p;-fc) 
is  Identically  zero  on  7.  Thus,  for  every  p  there  is  a  C  *  0  such  that 
(44)  holds,  and  C  ■  0  if,  and  only  if,  V(tQ,p;-fc)  *  0  for  all  p  €  7.  He 
note  again  that  C  depends  only  on  the  algebraic  sign  of  p.  Some  conditions 
under  which  C  ^  0  are  summarized  in  the  following  result. 


Theorem  4:  Suppose  7  is  such  that  the  following  three  conditions  hold: 
(1)  There  is  a  set  flc  n  and  a  constant  K  >  0  such  that  x  €  0 
implies  fn(x)  ■  K  sgn(x). 


%  •  ^  .  *  s  '  «  «  f  '  •  *■  v  '  %  "*  •  m  %  .  * 


(ii)  There  exist  pdf's  and  pj  5  such  that  p^  ■  Pj  *  pR 

m  m  m 

on  fic  and  J*  Jyjp^CyJdy  <  J*  |y|pR(y)dy  <  J*  |y|p2(y)dy. 

(ill)  0  <  I(p  )  <•  and  lim  *0(y)p'(y)-0  for  p-pR,  P-Pj^  and  P"P2* 

|y|  ^  • 

Then  V^^p^-i)  >  0  and  V($0,p2;-%)  <  0,  and  hence  the  constant  C  of  (44) 
is  positive. 

Proof:  See  the  appendix. 

Thus  we  see  that,  under  the  conditions  of  Lenina  1  and  Theorem  4,  the 
improvement  in  worst-case  performance  by  using  fR  rather  than  is  of 
first  order  in  jp).  Theorem  3  implies  that  of  (32)  also  yields  this  n(p) 
improvement.  Note  that  the  conditions  of  Theorem  4  are  satisfied  by  most 
of  the  usual  models  for  distributional  uncertainty.  For  example,  the 
contamLnated-Gaussian  class  treated  in  Section  IV  satisfies  Conditions  (1) 
through  (ill)  as  is  easily  seen  from  (35),  (36),  (38),  and  (40).  Other  classes 
that  satisfy  these  conditions  include  contaminated -mixture  classes  with 
nominal  models  other  than  Gaussian  (see  Huber  [22]  and  Kassam  and  Thomas  [8]), 
p-point  classes  as  considered  by  El-Sawy  and  VandeLinde  [9,10],  and  the 
class  of  densities  whose  cumulative  probability  distribution  functions  differ 
in  sup-norm  from  the  standard  Gaussian  by  no  more  than  some  prescribed  amount 
(see  [22]).  Thus,  we  may  conclude  that  fM  is  generally  preferable  to 
to  the  extent  that  0(p)  terms  are  appreciable  (i.e.,  to  the  extent  that  the 
model  of  (2)  is  of  Interest). 
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VI.  Summary  «nd  Discussion 

In  this  paper  we  have  considered  the  problem  of  designing  robust  systems 

for  die  detection  of  signals  in  weakly  dependent  noise.  To  find  solutions 

to  this  problem  we  have  considered  the  class  of  M-de tec tors  that  was  proposed 

by  El-Sawy  and  VandeLinde  in  [91  for  robustness  in  the  corresponding 

independent-noise  case.  For  this  class  of  detectors  it  was  seen  that  the 

robust  estimation  analysis  of  Portnoy  [18]  is  applicable  to  the  design  of 

robust  structures  for  the  weakly  dependent  moving-average  model  of  noise 

dependence.  In  particular,  it  was  seen  that  an  or- level  robust  M-detector  cp 

*R 

for  a  class  7  of  noise-generating  p.d.f.'s  can  be  sought  by  choosing  pR  to 
minimize  Fisher's  Information  I(p)  over  7  and  then  taking  from  (27).  The 
detection  threshold  rR  is  chosen  by  (24)  and  must  lie  between  0  and  8  for 
approximate  ndnimaxity.  Since  is  usually  not  bounded  it  is  intuitively 
more  reasonable  to  modify  by  truncating  the  linear  term  at  points  +  k”1 

where  k  is  0(p)  as  in  (32).  The  resulting  M-detector  cp.  is  equivalent  to 

2  M 
cp  to  0(p  )  under  the  conditions  of  Theorem  3.  The  worst-case  performance 

fR 

of  both  of  these  detectors  is  better  by  0(p)  than  that  of  the  p  -  0  robust 

M-detector  cp  under  the  conditions  of  Lessna  1  and  Theorem  4.  As  is  the  case 
*0 

with  results  of  Part  I  of  this  study  [1],  the  results  of  this  paper  can  also 
be  extended  straightforwardly  to  moving  averages  of  higher  order  than  (2) 
by  applying  the  results  of  Portnoy  [19]  for  M-estimation  in  such  models. 
However,  the  basic  structure  and  performance  of  the  robust  M-detector  are 
unchanged  by  this  generalization. 

In  general,  to  implement  the  robust  detector  cp  one  mist  first  compute 
8  (x)  from  (7)  and  then  compare  this  value  to  a  threshold.  However,  note 


that,  if  tj£x)  i*  s  strictly  increasing  function  of  x, 
strictly  decreasing  function  of  T;  and,  in  this  case, 
as  follows  (sea  [9]): 


than  Z  t  (x.  -T)  is  a 
i-1  *  1 

<p  can  be  imp  lamented 


71  ;  if  Z  ♦«(x1-T)<0 

i-1  *  T 


9*  <*)  -<  Y  i  if  2  ♦»(*!-*-> 
*R  1  i-1  *  1  T 


0  ;  if 


Z  t_(x.-T)>0 
i-1  R  T 


(46) 


where  Tt  and  y  are  chosen  to  give  desired  false-alarm  performance,  the 
structure  of  (46)  is  simpler  to  implement  than  is  (6)  since  it  is  not  necessary 
to  solve  (7)  to  perform  the  test  of  (46).  However,  it  is  not  always  the 
case  that  is  increasing,  although  it  will  be  increasing  if 

df  Q(x)/dx  >  for  all  x  €  K  .  (47) 

If  pR  is  strongly  unimodal  (l.e.,  -log(pR)  is  convex)  then  (47)  holds  for  all 
p  <  0;  however,  for  p  >  0,  (47)  does  not  hold  for  many  practical  cases 
even  when  pR  is  strongly  unimodal  because  of  the  redescending  nature  of 
(such  as  in  the  contaminated -Gaussian  example  of  Section  IV) .  If  is  not 
strictly  Increasing,  then  (46)  cannot  be  used  and  (7)  mist  be  solved; 
however,  efficient  iterative  techniques  for  solving  (7)  have  been  developed 
(see,  for  example,  Collins  [31]). 

The  robustness  of  the  proposed  M-detector  is  restricted  to  situations 
for  which  the  threshold  tr  of  (24)  is  between  0  and  6.  As  noted  above,  the 
trivial  condition  cr  <  %  is  sufficient  for  tr  to  be  positive;  however,  the 


condition  that  tr  be  less  than  9  places  a  lower  bound  on  9,  for  fixed  a  and  n. 


k 


for  which  9  Is  s  mud. min  solution.  If  9  is  smaller  then  Is  necessery  the 
fR 

alternate  approach  of  local  robustness  (see,  for  example,  Kassam  and  Thomas 

[8])  can  possibly  be  applied.  However,  the  redescending  nature  of  the 

solution  to  m^n  V(f,p;p)  for  p  >  0  may  cause  problems  in  the  local  forma* 

latlon  (see  elso  Kassam,  et  al.  [32]). 

Before  concluding,  two  comments  concerning  the  overall  optimality  of 

the  proposed  robust  detector  are  in  order.  First,  since  we  have  considered 

only  M-de tec tors,  a  question  arises  as  to  the  performance  of  the  proposed 

detector  relative  to  other  detectors  that  are  not  of  this  form.  In  answer 

to  this  question  we  note  that  it  has  been  demonstrated  in  [18]  (Theorem  1.1) 

that,  within  regularity,  the  optimum  M-es tlmate  of  6  in  (1)  has  variance 
2 

0(p  )  close  to  the  theoretical  minimum  possible  veriance  for  asymptotically 

Gaussian  unbiased  estimates  of  9.  Thus,  among  threshold  detectors  besed  on 

estimates  of  9,  the  class  of  M-de tec tors  are  capable  of  achieving  overall 

2 

optimum  performance  to  0(p  ).  As  a  second  comment  we  note  that  higher  order 

* 

(in  p)  expressions  for  the  asymptotic  variance  of  0^(X)  are  available  (see 


[18,  p.  39]  and  [19, 


2.1]);  thus  the  question  of  whether  or  not  a 


corresponding  higher  order  optimum  Influence  curve  is  possible  arises. 

However,  that  no  such  M-estlmate  exists  follows  from  Theorem  2.1  of  [19] 

which  states  that,  for  fixed  p  and  within  mild  assumptions,  no  M-es tlmate 

2 

depending  only  on  p  can  achieve  variance  closer  than  0(p  )  to  the  theoretical 
minimum  variance .  This  implies  a  similar  statement  for  M-de tec tors . 


iwwvwvvr* 
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Appendix 

A.  A  Proof  of  Theorem  3 


Define  Z  ■  msxf-k^2,  min{Y^,kp2}}.  We  heve 

‘  EC*r<y1»  +  W  -  ♦ROt1»fRCt1)l 
+  EtW„«i>  -♦Rcr1)|2} 

-  E1»J<*1>1  +  ZK^EfCZ  .  tjHoCtj)} 

-  zj[  E{(Z  -  Y^Yj)  +  k£  E(  |Z  -  Yjl2)  .  (Al) 

Note  that  |Z  -  YjJ  3S  |YjJ  so  that  the  last  two  terns  in  the  right-hand  side 

2  2 

of  (Al)  are  each  bounded  In  magnitude  by  E[Yj}.  Concerning  the  second 
tern  in  the  right-hand  side  of  (Al)  we  have  (applying  the  Schwarz  and 
Chebychev  Inequalities) 

|EC(Z  -  Y1)tQ(Y1)}|2S  E{(Z  -  vSJofiWtlZ  -  Yj  >  0} 

S  E(yJ}  sup  t^OOPClYj  >  k*1} 

y  €  *  p 

5  E2(Y2}  sup  tgCy)^  • 

r  €  *  p 

Thus,  since  k  and  K_  are  0 (p ) ,  we  have 
P  R 

eC^)}  "  EC^<Y1>5  ♦  0(p2)  .  (A2) 

Similarly,  we  have 

*(♦;«!>}  -  Et^tYj)]  +  KrP[|Y1|  >  k’1} 


(A3) 


•  BCVr(T1»  +  "  Z>1 


-  *(*!♦*<*!>!  +  0 (pZ) 


Equation*  <A2)  through  (AA)  imply  (33).  In  each  of  (A2),  (A3),  and  (AA) 

2 

the  higher  order  terms  In  p  are  uniform  over  7  if  E(Yj}  has  a  uniform  bound. 

2 

Thus*  in  this  case,  the  0(p  )  in  (33)  is  uniform  if  (A3)  is  nonzero  over  7, 

2 

a  condition  which  occurs  if  inf  '  (Y. )}  >  -K_B  k  .  The  rest  of  Theorem  3 

P  €  5  R  1  *  p 

follows . 

B.  A  Proof  of  Theorem  A 

The  sign  of  V(tn,p;-%)  is  the  same  as  the  sign  of 


J*  *0  P"(J*  *6  p)<T  Wo(y)p(y)dy)*  ( 

Writing  p  *  pR  +  (p  -  p  ) ,  we  see  that  the  quantity  of  (Bl)  is  equal  to 

J1  *0  PR  -  </W  +J’*o(P-PR) 

-  <J>o<p“pR))(.f  -  ^^rX/  yf0(y)(p(y)  "pR(y»dy-  < 

The  first  two  terms  of  (B2)  add  to  zero.  If  p  and  p  differ  only  on  0  the 

K 

third  and  fourth  terms  of  (B2)  are  both  zero  and  the  final  term  of  (B2) 
becomes 


a  • 

Kpr)k[J*  |y|PR<y>dy  -  !  |y|p(y)dy] 


Theorem  A  follows  from  (B3). 
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Footnotes 


1.  Recall  that  the  local  power  slope  Is  the  criterion  for  designing  locally 
most  powerful  detection  systems  (see  Capon  [6]  or  Ferguson  [71). 

2.  Note  that  the  sample  mean  can  be  replaced  by  any  other  consistent  estimate 

2 

of  8  in  this  analysis  (see  [181) »  In  which  case  the  condition  o  <  • 
might  be  relaxed  in  Theorem  1. 

3.  As  in  [11,  by  O(p^)  we  mean  lim  |0(p^)/p^|  <  •. 

p  “•  0 

4.  It  should  be  noted  here  that  V(tQ>PR;0)  is  the  worst-case  value  of  V(f0,p;0) 
over  p  €  9,  but  it  is  not  necessarily  true  that  V(^Q,pRjp)  is  the  worst-case 
value  of  V(fQ,p;p)  over  9  for  p  +  0.  In  fact,  this  latter  situation  is 

not  usually  the  case,  as  follows  from  Lemma  1  and  Theorem  4. 
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Influence  curve  for  robust  M-detection  in  dependent  contasdnated 
Gaussian  noise  with  «  -  0.1  and  p  *  0.1. 


i 


@ 


I 


.  r  •  *  •  * 

Jta&uls 


.*  >  /• 


